Network Growth with Preferential Attachment for High Indegree and Low Outdegree 
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We study the growth of a directed transportation network, such as a food web, in which links 
carry resources. We propose a growth process in which new nodes (or species) preferentially attach 
to existing nodes with high indegree (in food-web language, number of prey) and low outdegree 
(or number of predators). This scheme, which we call inverse preferential attachment, is intended 
to maximize the amount of resources available to each new node. We show that the outdegree 
(predator) distribution decays at least exponentially fast for large outdegree and is continuously 
tunable between an exponential distribution and a delta function. The indegree (prey) distribution 
is poissonian in the large-network limit. 



I. INTRODUCTION 



Directed networks that transport a resource, such as energy, from one or several sources to a large number of 
consumers are important in many areas of science Among such networks, food webs provide an example of great 
interest, both from a purely scientific point of view, and because of their importance for nature-conservation efforts 
• Although knowledge is rapidly accumulating about the structure of food webs 0, 0, 0, B El 0] > and static models 
have been developed to describe some of their statistical properties 0, 0, HI, [1] , we are as yet only beginning to develop 
y ' an understanding of the processes by which such networks are formed and evolve under the influence of speciation, 
invasion, and extinction of interacting species (oL ITol. fTTI. [l2l [T3. fl~ij . 

An important aspect of the network structure of food webs is that they have de gree distributions that generally 
decay quite fast with increasing degree - in most cases at least exponentially 0, 0, \§l Il3ll. This is in sharp contrast to 
the class of networks known as scale-free, which have power-law degree distributions |l5l. [l6j . While there has been 
a veritable explosion of research on scale-free networks, there has been no similar surge of interest in networks with 
rapidly convergent degree distributions. Most food webs, and some (but not all [17]) transportation networks, such as 
the North American power grid [Lsl ]. and the European railway network [l9j belong to this class. Much work remains 
, to be done before a comprehensive understanding of the mechanisms by which such networks evolve is reached. 
^ ■ As a step toward the development of such an understanding, we here propose a network growth scheme that produces 
<^> a poissonian indegree distribution (in food-web language: prey distribution) and an outdegree (predator) distribution 
C*~) . that is continuously tunable between an exponential distribution and a delta function. We note that these degree 
distributions do not agree with current food-web theory. In particular, the indegree distribution produced by the niche 
model Q has an exponential tail Q. It has been claimed that models with an exponentially decaying probability of 
preying on a given fraction of species with lower or equal niche values are capable of producing food webs that are 
structurally in agreement with empirical data [||. However, it is not clear why this condition is necessary or why 
schemes that invoke no physical mechanisms (as in the niche model) are able to produce such webs. Therefore, other 
plausible schemes should also be explored. 

Our model employs a scheme in which new nodes (species) attach to existing nodes with a preference for nodes i 
with high indegree k[ and low outdegree ki. In food- web terms, this corresponds to a prospective predator choosing 
prey that have a large number of resources (represented by the large indegree) , while the competition from previously 
established predators should be as small as possible (low outdegree). Among the influences on the network growth 
process mentioned above (speciation, invasion, and extinction), we have thus chosen to focus on invasion and/or 
speciation. By ignoring extinction, we essentially model the early phase of steady network growth. The proposed 
growth process corresponds to a probability of attachment, 

n$,fci)oc (l) 
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with 7 > 0. This attachment scheme is the direct opposite of the "rich get richer" scheme of preferential attachment 
with an attachment probability proportional to the total degree /, = k[ + ki, which is known to produce scale- free 
networks and has been studied in a myriad of variations over the last decade [lU To emphasize this difference, 
we shall call the scheme proposed here inverse preferential attachment. [We note that nonlinear forms of the "rich 
get richer" scheme with a probability of attachment proportional to l°_ with a > have also been studied. However, 
except for a = 1 , no a > leads to a power-law degree distribution [20| .] 

In a previous paper [2l| , we studied a simplified version of the scheme proposed here, in which a new node makes a 
constant number of incoming links (k[ = const.), and 7 = 1. In this simplified version, the probability of attachment, 
H(ki) oc 1/ki, depends only on the outdegree (the number of predators). We calculated the outdegree distribution for 
this simplified model both analytically and by Monte Carlo simulations. It is given by the self-consistent equation 

n* k = (k + l)(m/4) fe r( '+^ / f ) , (2) 
T(k + 2 + m/z* m ) 

where 
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(3) 



and r(a;) represents the Gamma function. 



II. MODEL AND RESULTS 



In the present paper, we investigate the general form of the attachment probability presented in Eq. ([I]). With this 
form we relax both of the restrictions of the simplified model: we do not fix the indegree (number of prey) for the new 
nodes, so that each can make a different number of links, and we also vary the exponent 7. This makes full analytical 
treatment much harder, and the results for the outdegree distribution presented here are therefore only numerical. 
The indegree distribution, however, is found analytically to be a poissonian in the large-network limit. 

The generalized attachment process proceeds as follows. We start the growth process with N nodes and assign 
each initial node an indegree m < Nq. These nodes act as source nodes as they are not connected to any other node 
at the beginning. Actually, the attributes of the initial nodes have no significance for the statistics because the final 
size of the network, A max + No, is much larger than No- In each time step, we add a new isolated node. Then, we 
give the new node m chances to establish a link to an existing node with probability 

5 (iE^r) 7 ' (4) 

where 

with 7 > 0. Here, N denotes the number of existing nodes at that time step. We use ki + 1 in the denominator to 
prevent a divergence for ki = 0. Multiple links between two nodes are not allowed. The direction of a link is from the 
old node to the new one. 

We implement the growth process in Monte Carlo simulations as follows. We seed the system with Nq source nodes, 
each with indegree to, and introduce a new node in each Monte Carlo step. To create links between the new node and 
the existing ones, we pick existing nodes, i, one by one and calculate the probabilities of attachment, n(fc^, ki). Then, 
we generate a random number, r, and attach the new node to node i if r < H. We repeat this procedure until all 
existing nodes in the network are tested, i.e., till the sweep is completed. Since '^2 i n(/c J ', ki) = 1, a new node makes 
on average one connection per sweep. We sweep the whole network to times, so that (fc,-) = (ki) = to. The new node 
is kept in the system, even if it does not acquire any links. However, a node with k' = stays isolated throughout 
the growth since the probability of attachment to it is zero. We stop the growth when the network size N reaches 
Amax + N nodes with N max = 10 5 . We average over fifteen independent runs for each value of to and 7. 

We first tested the case of 7 = 1 to compare the outdegree distribution of the full k'/k model to the outdegree 
distribution of our simplified l/k model, Eqs. (J2IS1), which, for large k, decays like kn k /T(k), where /j, is a constant. 
As seen in Fig. [TJ the outdegree distribution for the k'/k model also decays faster than exponentially for large k. 
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FIG. 1: Outdegree distributions for 7 = 1 and m = 1, 3, and 5 with No = 10 shown on linear (a) and log-linear (b) scales. The 
simulations were stopped when the network size reached No + 10 nodes. Each curve (with symbols) represents an average over 
fifteen runs. The curves without symbols are the theoretical outdegree distributions for our simplified model with Tl(k) oc 1/k. 
Both the k' /k and 1/k models yield the same distribution for m 3> 1. As A: is a discrete variable, lines connecting the symbols 
are merely guides to the eye. 



However, the dependence on the variable indegree leads to a broadening of the outdegree distribution: decreased 
probabilities for k w m, and compensating increased probabilities for k 3> m and k <c m. In the limit of large m, the 
central part of the outdegree distribution of the k'/k model approaches that of the 1/k model. 

The outdegree distribution of the general model also varies with 7. Higher values of 7 sharpen the peak of the 
distribution around the mean outdegree, m, as it increases the tendency of the new nodes to prefer existing nodes 
with a higher value of k'/k (Fig. [5]). In the limit 7 — > 00 one should obtain a delta function at k = m. Similarly, 
lower values of 7 relax the constraint and flatten the outdegree distribution. The limiting case, 7 = 0, corresponds to 
growth without preferential attachment, which yields an exponential outdegree distribution of mean m fl6L [2lj] . The 
lines connecting the symbols are guides to the eye. 

In contrast to the outdegree distribution, the indegree distribution of the generalized model in the N ^> m,k' limit 
can be described analytically and is extremely well approximated by a Poisson distribution with mean m, independent 
of 7 (Fig. [3]). This can be shown as follows. Each new node makes one link per sweep on average. When N ^> to, 
each existing node has a probability equal to l/N of acquiring a new link per sweep on average, independent of the 
history of the network. Therefore, the probability of acquiring k' links (after m sweeps) for the node added at time 
t, when the total number of nodes is N, is a binomial, 

P k '(N)=^p k '(l- P ) N - k ' (6) 
with p = m/N . Thus, the final indegree distribution is an average (ignoring the iVo initial nodes), 

n * = w— E Pk, w- w 

JVmax N=N 
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FIG. 2: Outdegree distributions for 7 = 0.5, 1, 2, and 3, with network size iVo + 10 nodes, and m = 5. Each curve is averaged 
over fifteen runs. Inset: The same distributions shown on a log- linear scale. The lines connecting the symbols are guides to the 
eye. 
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FIG. 3: Indegree distributions for 7=1 and m = 1, 3, and 5 with No = 10. The simulations were stopped when the network 
size reached No + 10 nodes. Each curve represents an average over fifteen runs. The error bars are smaller than the symbol 
sizes. Inset: The same distributions shown on a log-linear scale. The symbols *, x, and + show the Poisson distribution, Eq. 
l|8|l. for m = 1, 3, and 5, respectively. The lines connecting the symbols are guides to the eye. See text for details. 



In general, this cannot be calculated exactly. However, for N 3> m, k' , Pk>(N) can be approximated by a poissonian 
of mean m [22| , 

m k ' exp(-m) 

Pk ' m ' (8) 

which is independent of N. The convergence with N to this result is fast, so that for 7V max 3> No,m, the sum in 
Eq. ||7j) is dominated by the TV-independent terms. As a result, 

<, « ^^"^ (9) 

is an excellent approximation, as shown in Fig. [3J Computer simulations confirm the 7-independent form of the 
indegree distribution, as shown in Fig. [4] 



III. DISCUSSION 



The mechanism that we propose and study in this paper, growth by preference for high indegree (number of prey) 
and low outdegree (number of predators), is intended to simulate the early stages of the development of food webs 
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FIG. 4: Indegree distributions for 7 = 0.5, 1, 2, and 3 with m — 5, and network size No + 10 5 nodes. Each curve is averaged 
over fifteen runs. The distributions are identical and they all practically overlap with the Poisson distribution, Eq. JHJ, with 
m = 5. Inset: The same distributions shown on a log-linear scale. The lines connecting the symbols are guides to the eye. 

and other transportation networks. To distinguish it from the more commonly studied "rich get richer" schemes that 
produce scale-free networks, we call it inverse preferential attachment. 

The outdegree distribution obtained using the generalized form of the probability of attachment, UQc^ki) oc 
(fc^/fci) 7 , (with 7 = 1) is broader than the one obtained from the simplified form, Il(fcj) oc 1/fej. However, both 
decay faster than exponentially for large fcj. The shape of the outdegree distribution is continuously tunable by 7, 
from a exponential distribution for 7 = to a delta function for 7 — > 00. The indegree distribution does not depend 
on 7, but only on m, the mean number of links per node. In the limit N 3> fc,m, the indegree distribution is a 
poissonian. 

Some features of the networks produced by this model, like the outdegree distribution, which decays faster than 
exponentially, resemble those of some empirical and model food webs US B El, EH- In this, they differ sha rply 
from the scale-free networks generated by the conventional "rich get richer" preferential-attachment schemes [ill Il6| . 
However, differences from real food webs remain, such as the correlation between the in- and outdegrees of a node. 
Our model produces webs with a positive in-outdegree correlation, whereas most empirical and model webs have a 
negative correlation (til. [l3l]. This may be due to the unrestrained growth of our networks, which would require an 
extinction process to achieve a steady state [l2| . Also, this growth scheme is not designed to produce loops. We intend 
to include such features in future versions of the model, thus enabling modeling of mature, steady-state networks. 
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